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HUMAN ENDOGENOUS RETROVIRUS IN BREAST CANCER 

BACKGROUND OF THE INVENTION 

The present application takes priority from Provisional U.S. patent application number 
60/141,626 filed on June 30, 1999 the entire text of which is specifically incorporated herein by 
reference. 

1 . Field of the Invention 

The present invention relates to compositions and methods for the detection, prognostic 
evaluation, and treatment of oncogenic disorders, particularly breast cancer. 

Specifically, the instant invention provides for compositions, useful for identifying and 
treating disorders related to newly identified endogenous retroviruses which are present in a 
subset of humans, cats, and nonhuman primates. 

2, Technical Problem Addressed by the Invention 

Mutations in known susceptibility genes do not account for all breast cancer 

Breast cancer (BC) is one of the leading causes of cancer death among women. The 
induction of BC is thought to involve the interplay of several factors, including the genetic, 
hormonal, immunological and physiological status of the host, as well as dietary habits and 
exposures to chemicals, radiation or infectious agents. It is now clear - that variations of several 
genes, including BRCA-1 and BRCA-2, can result in greatly increased risks for development of 
BC. However, defects in known BC susceptibility genes account for only about 5% of BC, the 
so-called familial cases (Armstrong et a'L, 2000: Gayther et al, 1 998). 

As with other types of cancer, the possibility that a vims is etioiogically involved in 
sporadically occurring BC has not been eliminated. Consequently, there has long been a need to 
determine which, if any viruses are causally linked to the development of BC. The identification 
of such a vims would likely provide invaluable aid in the following areas BC medicine: 
prevention, diagnosis, determination of prognosis, and treatment. 
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3. Description of Related art 
Retrovirus induction of breast cancer lis mice 

Mouse mammary tumor virus (MMTV), a B-type retrovirus, was discovered during 
studies of hereditary cancer in mice at the Jackson Laboratories in the 1 930's (Bittner, 1936). As 
the prototype of slow-transforming retroviruses, MMTV has been definitively shown to cause 
BC in mice. Prior studies established that MMTV is transmitted both in the germline as 
endogenous proviruses and exogenous!}' via milk. As endogenous elements MMTV proviruses 
follow patterns of Mendel ian inheritance, as other sequences in the genome (Cohen et al, 
Ce//,1979; Cohen and Varmus, 1979, 1980; Traina et al, 1981; Traina-Dorge and Cohen, 1983; 
Traina-Dorge el al, 1985; Varmus et al, 1978). Horizontal transmission of MMTV typic lib 
occurs by infection of mouse pups by MMTV virions present in the milk of infected dams. 
Thus, it is possible to transmit MMTV to mice by foster feeding. 30 or more unique proviral 
integration sites for endogenous MMTV have been identified. However, some wild mice do not 
carry any endogenous MMTV proviruses (Cohen and Varmus, 1979; Cohen et al, 1982). This 
result suggests that the many endogenous MMTV proviruses are relatively recent additions to the 
mouse genome. The most likely explanation is that MMTV entered the germline of certain mice 
(but not others) on multiple occasions after the evolutionary splits among the various species and 
subspecies of the genus Mm. Certain endogenous MMTV can be activated by hormones to form 
infectious virions capable of inducing mammary carcinomas after long latency periods. Most 
endogenous MMTV proviruses are defective and do not encode for infectious virions. 

Roles of MMTV genes and cellular genes in oncogenesis 

The MMTV Orf protein can function as a superantigen (SA). When expressed in the 
thymus during fetal/early development it can mediate complete or incomplete deletion of SA- 
reactive T-cells. SA expression is required to activate B-cells targets of MMTV in the 
gut-associated lymphoid tissue of nursing pups. Complete deletion of the SA responsive clones 
thus renders the mice resistant to MMTV infection in the gut and thereby leads to a low- 
incidence of MMTV-induced tumors. On the other hand, in mice with only partially deleted 
responsive clones of lymphoid ceils the SA activation stimulates expansion of the targets arid 
spread of MMTV. As female infected animals reach puberty, estrogenic hormones drive 
expression of the MMTV long terminal repeat (LTR) through its hormone response element 
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(HRE). This permits production and assembly of MMTV and spread of the vims to other 
hormonal ly-sensiti ve tissues, including the breast and ovaries. Integration of MMTV LTRs 
adjacent to certain cellular genes, such as the proto-oncogenes Int, Writ and Fgf, can increase 
expression of these genes resulting in BC and other cancers. 

The molecular genetic interactions between MMTV, the immune system of its murine 
host, and the breast and other hormonal ly-sensitive cells malignantly transformed by this 
retrovirus have been extensively studied. MMTV promotes mammary gland cancer in mice by 
insertional mutagenesis (Varmus et al, 1978; Varmus, 1985). MMTV proviral LTR elements 
direct steroid hormone-dependent transactivation of various cellular oncogenes including Writ, 
Fgf and Int thereby promoting clonal expansion of tumor cells (Shackleford and Varmus, 1987, 
Shackleford et al, .1993: Jakobovits et al, 1986; Nusse, 1991; Nusse et al ,1985). For 
productive, persistent infection and completion of its replication cycle, MMTV must contain a 
superantigen and interact with a functional host immune system (Golovkina et al, 1995; Luther 
and Acha-Orbea, 1996; Coffin, 1992). 

The search for a human breast cancer virus 

The discovery of the oncogenic MM TV has prompted many investigators to explore a 
retroviral etiology for BC in humans (Sarkar, 1980). Data collected over the past five decades 
has suggested the existence of a human homoJogue of MMTV. In 1971, Moore and associates 
reported that 60% of human milk samples from BC patients contain B-type particles 
indistinguishable from MMTV by electron microscopy, compared to 5% of the general 
population (Moore et al., 1971). These investigators also reported that 39% of Parsi women of 
India, an inbred population with a two-fold increased incidence of BC, had B-type particles in 
their milk (Das et al, 1972: Moore, 1971}. Several studies have demonstrated that BC cells, but 
not ceils from normal tissues, also contain reverse transcriptase (RT), an enzyme associated with 
all retroviruses. Numerous investigators have examined serum and breast milk for the presence 
of antibodies reactive with MMTV. Most of these studies were performed in the pre-AIDS era, 
prior to the advent of highly sensitive and specific techniques for detecting anti-retroviral 
die tta necess y for detection of HIV antibodies in donated blood. 
Despite the numerous electron microscopic, biochemical and immunological studies on 
human breast carcinoma tissue, milk, patients' sera, and breast carcinoma cell lines suggesting 
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the existence of a human homologue of MMTV, proof that such an agent exists has remained 
elusive (Andersson et al, 1996; Ziegier, 1997). Most authors have dismissed the importance of 
prior studies purporting to show evidence of a human homologue of MMTV because of the 
presence of numerous human endogenous retroviruses (HERVs) (Larsson et al, 1994; Li el al, 
1 996; Lower et al , 1 996; Meese et al , 1 996: Ono, 1 986; Patience et al. , 1 996; Faff et al. , 1 992). 
There are about 50,000 HERVs or HERV-reiated sequences in the human genome, some of 
which have been shown to have up to 60% homology to MMTV, In this regard, it is important 
to note that seroreactivity to HERV-K10. to this point the HERV considered to he most closely 
related to MMTV, cannot account for MMTV -reactive antibodies present in the sera of breast 
cancer patients and the smaller number of healthy individuals (Vogetseder e> al. . 1995). 
Furthermore, we believe that the presence of these MMTV-related sequences is precisely the 
reason that human homologues of MMTV have not previously been demonstrated conclusively 
by molecular techniques. The presence of these relaied, but distinct, sequences could have 
obscured the detection of more closely related sequences by prior investigators who used less 
sensitive techniques, such as Southern blotting. 

Only recently have sequences with relatively high homology (>90%) to those of MMTV 
been isolated from human BC tissue (Wang et al, 1995, 1998: ). Sequences 95 - 99% similar to 
MMTV env were amplified by PGR in 121 (38.5%) of 314 unselected breast cancer tumor 
samples. It is pertinent to note that the MMTV-like sequences were detected in only 2 (1.8%) of 
107 breast specimens from reduction mammoplasties and in 0/80 samples from normal tissues or 
non-breast tumors. The MMTV-env like RNA was expressed (as determined by RT-PCR) in 
66% of DMA PGR positive breast tumors (Wang et al, 1998). A complete 9.9 kb pro virus with 
94% similarity to MMTV was detected in 2 breast tumors. FISH (fluorescence in situ 
hybridization) revealed integration at several sites in DNA derived from BC tumors, but not 
normal breast cells (Wang et al, 1999 ACR mtg. abstracts #2933, 2944). Wang et al. suggested 
the existence of a human mammary tumor virus (HMTV) that is spread by the exogenous route 
of infection (horizontal transmission). Attempts by these and other investigators to amplify other 
regions of MMTV-related viruses, from the genomic DNA or cDNA of subjects who did not 
have BC, yielded HERV sequences (such as HERV -K 10) with only about 60% homology to 
MMTV. Thus, BC tissues are the only tissues in which sequences that are highly similar to those 
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of MMTV have been heretofore found. Consequently normal breast tissue and other tissues 
appeared to he negative for the expression of viruses with high homology to MMTV. 

MMTV can foe transmitted horizontally or vertically 

The retrovirus replication cycle is characterized by conversion of the single-stranded 
RNA viral genome into doubled-stranded proviral DNA by the multiple enzymatic activities of 
the virion-assoeiated reverse transciptase (RT). As is the case with other retroviruses, 
integration of MMTV proviral DNA into the genome of host cells is required for expression of 
viral proteins and production of infectious progeny. In both infected mouse mammary glands as 
well as heterologous cells MMTV proviral DNA is integrated into a large number of apparently 
random sites. Integration of MMTV proviruses containing transcriptionally active LTR near 
some cellular genes (proto-oncogenes), such as Int, Writ, and Fgfcm result in over-expression of 
these genes, cellular transformation and clonal expansion of the tumor cells (Varmus, 1985; 
Shackieford et al., 1993; Jakobovits et al, 1986; Cohen, 1980; Breznik and Cohen, 1982). The 
long latency of MMTV-induce carcinogenesis is explained in part by the necessity for proviruses 
to integrate into these particular sites. 

Genetic differences among viral strains of MMTV can account in part for the varying 
incidence of BC in diverse strains of mice. Mice of the C3H strain have a greater than 90% 
incidence of BC, compared to a <1% incidence of BC in BALB/e mice. BALB/c mice foster- 
nursed on C3H females have a high incidence of BC which suggested that the tumorigenic 
MMTV of C3H can be horizontally transmitted in the milk (Bittner. 1936). Conversely, when 
C3H mice are foster nursed on a BALB/c female the incidence of BC is significantly lower (22- 
55%), but not as low as low-incidence mouse strains. This latter observation underscores the 
importance of the horizontally-transmitted milk-born virus in the high incidence strains, but also 
indicates the substantial differences in tumorigenic potential among endogenous MMTV 
proviruses. Proviruses of various mice with high and low tumor incidence could be 
distinguished by differences in solution hybridization kinetics and restriction endonuc lease 
digestion patterns (Cohen et al. Cell, 1979; Cohen and Varmus, 1979, 1980: Traina-Dorge and 
Cohen, 1983; Breznik et al. 1984). Using restriction enzymes that differentiate between 
hypomefhylated and methylated DNA, proviruses of milk-borne MMTV were also shown to be 
hypornethylated, whereas most endogenous proviruses contain abundant 5-methyieytosme 
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(Cohen, 1980; Breznik and Cohen, 1982; Breznik et al, 1984). Because hypomethylation is 
associated with increased gene expression, this observation could explain the importance of 
horizontally-transmitted milk-home MMTV in mouse strains with a high incidence of BC. 
Specific hypomethylation of an endogenous MMTV provims was associated with expression of 
a 1.6 kb transcript of the lactating mammary gland of the BALB/c mouse (Traina-Dorge et al, 
1985). 

In other studies, endogenous MMTV proviruses have been shown to segregate as stable 
genetic units during inbreeding and that certain of these endogenous proviruses are 
transcriptionally active (Cohen et al, CW/,1979; Cohen and Varmus, 1979, 1980; Traina et al, 
1981; Traina-Dorge and Cohen, 1983; Traina-Dorge et al, 1985; Varmus et al, 1978). 
Recombinant inbred (RI) strains of mice have been used to define MMTV provims composition 
and chromosomal locations (Traina et al, 1981; Traina-Dorge and Cohen, 1983). RI strains 
were developed by crossing mice from two highly inbred lines, randomly mating the F2 
generation brothers and sisters and maintaining each as separate lines. Many genetic loci were 
mapped to specific chromosomes in these strains. By analyzing the segregation patterns of 
various MMTV proviruses with these genetic markers by restriction endonuclease analysis and 
Southern blotting, it was possible to determine the chromosomal locations of a number of 
MMTV proviruses in these strains and to establish that these provirus segregate by the rule of 
Mendelian inheritance. These and other analyses defined 10 distinct MMTV units (proviruses) 
in these animals some full length and some truncated. The ratios of inheritance for most of the 
MMTV proviruses were consistent with simple single-gene inheritance, though three of the 
MMTV units demonstrated some variance. Most noticeable was the presence of a unit 
contributed by one parent that was present in 23 of the 26 RI strains analyzed. Specific proviral 
units were identified that were transcriptionally active and associated with increased tumor- 
production (Traina-Dorge et al, 1985). Importantly, cellular genes were identified that were 
significantly associated with virus expression showing the involvement of host genetics in 
disease progression (Traina-Dorge et al, 1985). 

Several lines of evidence indicate that the endogenous MMTV proviruses have integrated 
relatively recently in the germline of various strains of mice (Cohen and Varmus, 1979; Varmus 
et al, 1978). If MMTV evolved from elements present in a progenitor of Mus museums (the 
laboratory mouse), it would be expected that all individual mice would have similar MMTV 
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provirases. However, both laboratory mice and wild mice trapped at several locations having 
quite variable numbers and distributions of germline integration sites for their MMTV pro viruses 
(Cohen and Varmus, 1979). The most striking finding among these results were the presence of 
some wild caught animals that contained no endogenous MMTV proviruses. The most likely 
interpretation of these results is that the endogenous MMTV proviruses arose by multiple 
independent integrations into the DNA of germinal ceils after speciation of the genus Mus, rather 
than arising from genetic elements present in the evolutionary progenitors of mice. 

SUMMARY OF THE INVENTION 

The present invention provides tor recombinant DNA molecules derived from one or 
more mammary tumor viruses which are endogenous retroviruses with homology to the 
sequences of MMTV. Specifically, the sequences of the instant invention have at least 99% 
identity with all or a portion of SEQ ID NOs: 2, 3, 4, 5, 6, 7, 8, 21, 23, 25, 27, or 29 or at least 
92% identity with SEQ ID NO: 1 0. 

Additionally, the present invention provides for the RNA molecules produced by 
transcription of the DNA described above. Furthermore it provides for the polypeptides resulting 
from the in-frame translation of RNA which has been transcribed from the MTV DNA of the 
instant invention. According to the present invention, the referenced DNA sequences may be 
derived from any suitable source, such sources may include, but are not limited to, human, cat, 
and rhesus macaque. 

The present invention also provides for a recombinant DNA plasmid (a vector) which 
comprises mammary tumor virus (MTV) DNA the sequences of SEQ ID NOs: 2, 3, 4, 5. 6, 7, 8, 
10, 21, 23, 25, 27, or 29, or sequences which have at least 99% identity thereto. That is said 
DNA sequence is incorporated in the vector. 

In one embodiment of the present invention the vectors further comprise a heterologous 
promoter operably linked to the MTV sequence (i.e. joined in the proper reading frame so as to 
he capable of producing functional MTV RNA and/or protein in vivo and in vitro). In one aspect 
of this embodiment of the invention the vector, which comprises the MTV DNA, is capable of 
episomal replication or chromosomal integration in at least one of the following ceil types: 
bacterial cells, yeast cells, insect cells , avian, cells, and mammalian ceils (this list of cell types is 
representative and should not be considered exhaustive). In another aspect, of this embodiment 
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of the invention, the heterologous promoter provides for the expression of the MTV DNA 
sequence in one or more cell types. Cell types considered useful as pail of this aspect of the 
invention include, but are not limited to the following: bacterial cells, yeast ceils, insect cells , 
avian, cells, and mammalian cells. 

According to another embodiment of the instant invention the MTV DNA sequences 
described above may be used to provide a method of detecting the presence of MTV DNA or 
RNA ^ " v. s i s s v i , uiheiwisei 

Another embodiment of the instant invention provides for a method of determining 
whether a sample contains antibodies which recognize proteins derived from the MTV DNA 
sequences described above (e.g polypeptide sequences derived from transcription and translation 
of SEQ ID NOs: : 2, 3, 4, 5, 6, 7, 8. 10, 21, 23, 25, 27, or 29). As a corollary to this aspect, the 
instant invention also provides for antibodies which specifically detect one or more of the 
polypeptides of the instant invention, 

Another embodiment of the invention provides for diagnostic kits useful for detecting 
DNA, RNA, or polypeptides, from a mammary tumor virus, in a biological or oilier type of 
sample. 

Another embodiment of the instant invention provides for methods of attenuating or 
eliminating the activity of MTV in its host animal. Various aspects of this embodiment provide 
for pharmaceutical compositions comprising substances which disrupt the activity of the MTV 
reverse transcriptase, protease, or integrase enzymes. Other aspects of this embodiment provide 
for pharmaceutical compositions capable of eliciting an immune response in a host animal, 

BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein. 

Figure 1 shows the results of amplification of sequences related to MMTV from human 
breast cancer tissue. Panel A: DNA was extracted from human breast tumors (kindly provided 
by Michael Press, M.D., USC, Los Angeles or Derrick Beech M.D., TMC/UT Memphis) and 
PGR was performed using primers specific for the human MMTV env-related gene. PGR 
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products were transferred to nitrocellulose by blotting and MMTV-reiated products were 
detected by hybridization to a 1.8 kb MMTV env probe, lane a: nonradioactive markers, not 
shown; lanes b-d, f-h, j: breast tumor DNA; lanes e and i: no DNA: lane k: water control; lane 1 
positive control: MMTV env fragment cloned in pBluescript 1M . Panel B: As a test for the 

5 integrity of the DNA from the clinical samples we amplified HERV-3 proviral DNA, a single 
copy human endogenous retrovirus using PGR conditions developed by Griffiths ei al (1997). 
Ethidium bromide detection (markers are visible in lane a). Same samples as Panel A except 
lane 1: positive control, HER.V3 pal fragment cloned in pBluescript 1 M (Stratagene). Visible 
bands were present in lanes c and g, but do not copy well. 

10 Figure2 shows an example of the amplification of sequences related to MMTV from the 

blood of healthy controls. Panel A: DNA was extracted from whole blood of healthy control 
subjects and PGR was performed using primers specific for the human MMTV env-related gene 
and PGR products were detected by Southern hybridization, lane a: markers; lanes b-k: DNA 
from whole blood of healthy controls; lane 3: water control; lane m positive control: MMT V env 

is fragment in pBluescript™ (Stratagene). Panel B: PGR amplification of HERV-3. Ethidium 
bromide detection (markers are visible in lane a). Same samples as Panel A except lane m: 
positive control, HERV3 pot fragment in pBluescript IM . 

Figure 3 shows an example of the detection of HMTV mRNA by ribonuclease protection 
assay. RNA was extracted from three HMTV PGR positive breast cancer tumors or from yeast 

20 ceils and hybridized to either 1 89 base probe specific for HM TV or a 245 base probe for p-aetin. 
Samples were then digested with RNAse A/IT. The fragments of the labeled probes protected by 
hybridized RNA were visualized and analyzed following separation on a denaturing 
polyacrylamide gel Two of the three tumor samples gave protected fragments of the expected 
size with the HMTV probe (arrow) (lanes 1-3), whereas all three tumor RNAs gave protected 

25 fragments with tire fi-actin probe (lanes (5-7). As expected neither probe was specifically 
protected by yeast RNA (lanes 4, 8), 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides for isolated DNA, RNA, and polypeptides derived from 
mammary tumor vims (MTV) which is a virus with high sequence homology to mouse 
30 mammary tumor virus. The present invention further provides for diagnostic methods of using 
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these macromolecules. Additionally the present invention provides pharmaceutical reagent 
compositions and diagnostic kits comprising MTV DNA, RNA, and/or polypeptide, which can 
be used according to the disclosed methods. 

Various embodiments of the instant invention provide for MTV nucleic acid 
corresponding to at least one of the following: DNA sequences with at least 99% identity with 
one of the following: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at. 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO: 7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 10.. 

Preferably the sequences have greater than 99% identity with greater than 200, 250, 300, 
350 or 400 nucleotides SEQ ID NO's: 2, 3, 4, 5, 6, 7, 8, 21, 23, 25, 27, or 29. In another 
preferred embodiment the sequences have greater than 93%, 94%, 95%. 96%. 97%, 98% or 99% 
identity with SEQ ID NO: 10. Even more preferably these nucleic acids are derived from human, 
cat, or rhesus macaque MTV's. Most preferably the DNA is identical with all or part of SEQ ID 
NO's: 2, 3, 4, 5, 6, 7, 8, 10, 21, 23, 25, 27, or 29. 

In one aspect of this embodiment of the invention the MTV DNA sequences described 
above are incorporated in a vector. In various related aspects die invention the MTV sequences 
are under the transcriptional control of an heterologous promoter. The vectors contemplated as 
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being useful according to the instant invention are capable of expressing the MTV DNA 
sequences in at least one of the following cell types: insect cells, bacterial cells, avian cells, 
yeast cells, or mammalian cells. Furthermore the vectors of this aspect of the invention are also 
capable of episornal replication and/or chromosomal integration in at least one of the cell types 
listed. 

In another aspect of this embodiment the DNA sequences comprise one or more detection 
moieties. Detection moieties contemplated as being suitable for the instant invention include, 
but are not limited to fluorescent dyes, and radioactive isotopes of phosphorous, sulfur, oxygen, 
carbon, or hydrogen. 

in yet another aspect of this embodiment of the invention the isolated DNA molecule is 
suspended or dissolved in a diluent compatible with the invention. Suitable diluents do not 
interfere with the use of the DNA according to die various embodiments of the instant invention. 
Diluents useful according to this embodiment of the invention are well known to those skilled in 
the art. They include but are not limited to buffered aqueous solutions. These may be buffered 
with any compound compatible with the present invention. Exemplary buffering agents include 
phosphate buffers and buffers comprising tri shydroxyaminomethane (Tris). Such buffered 
aqueous solutions may further comprise any other compound, such as sodium chloride, which 
will not interfere with the operation of the instant invention. According to this aspect of the 
instant invention the MTV DNA may be present in any suitable concentration. Preferably the 
concentration if from about 0.1 ng/p.1 to 100 jug/ul. 

Other embodiments of the instant invention provide for RNA transcripts and/or 
polypeptides encoded by the nucleic acids described above. In various aspects of this 
embodiment these RNA transcripts or polypeptides are encoded by any of the DNA sequences 
described above. These RN A transcripts and polypeptides are homologues of RNA transcripts 
and polypeptides encoded by the MMTV env, gag, or pol genes, in one aspect of this 
embodiment the RNA transcripts are encoded by the DNA sequences described above. Such 
RNA transcripts have a sequence identical to the disclosed DNA sequence except that the RNA 
contains uridine instead of thymidine. In another preferred aspect of this embodiment the 
purified polypeptide corresponds in sequence with ail or part of an env, gag, or pro, protein 
product from a human, cat, or rhesus macaque mammary tumor virus. It is preferred that the 
peptides be purified polypeptides. In a preferred aspect of the present invention the purified 
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polypep tides are comprised of all or part of the sequence resulting from the in-frame 
transcription and translation of any of the MTV DNA sequences described above. In an even 
more preferred aspect of the present invention the polypeptides correspond in sequence to the 
product of an in-frame translation of SEQ ID NO's: 2, 3, 4, 5, 6, 7, 8, 10, 21, 23, 25, 27, or 29. 
Most preferably the polypeptide sequences of the instant, invention correspond to ail or at least 
80 amino acids of SEQ ID NO's 12, 13, 14, 15, 16, 17, 18, 20, 22, 24, 26, or 28. The term 
"derived from MTV DNA" is meant to convey that the RNA or peptide corresponds in sequence 
to the RNA or peptide produced respectively by the transcription or transcription and in-frame 
translation of MTV DNA. 

By "purified polypeptides" it is meant that the majority (greater than 50%) of the 
polypeptides in the sample are the MTV polypeptides. Preferably, the MTV polypeptides 
constitute greater than 70% of the polypeptide in the purified polypeptide sample. More 
preferably the MTV polypeptide constitutes greater than 90% of the polypeptide in the sample. 
Even more preferably the MTV polypeptide constitutes greater than 95% of the polypeptide in 
the sample. Additionally, the term "purified polypeptide*' indicates that the sample does not 
contain substances which interfere with the operation of the instant invention. Purified 
polypeptides obtained in accordance with the current invention may be from any suitable source 
which is compatible with the instant invention. 

Another embodiment of the instant invention provides for an antibody against an MTV 
polypeptide as described above. Antibodies according to this aspect of the invention are 
generated using one or more of the polypeptides of the instant invention as the antigenic agent. 
Such antibodies may be either monoclonal or polyclonal in nature. Preferably the antibodies are 
monoclonal. Once the MTV has been provided the antibodies of this aspect of the invention may 
be prepared according to methods well known to those skilled in the art. For example, 
polyclonal antibodies are commonly produced by injecting the antigenic agent (in the presence 
of an immune response en! ncinj ij ;nt such as complete Freunds adjuvant) into an animal such 
as a sheep, bovine, equine, goat, or rabbit. 

Monoclonal antibodies according to this aspect of the present invention can be prepared 
by hybridoma fusion techniques or by techniques that utilize Epstein Barr Virus (EBV) - 
immortalization technologies (to produce human mAbs), such as are well known by those of skill 
in the art, modified as described herein. In the method of the invention, these techniques involve 
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the injection of an iramunogen, in this case, purified MTV polypeptide so as to elicit a desired 
immune response in that animal (i.e., production of antibodies). The experimental animal, (e.g., a 
mouse) is given repeated injections (boosts) of the same immortalized cell line. In a final step, 
the animal is given an injection of primary cells of the chosen ceil type. 

In the illustrative example herein for the production of agonist monoclonal antibodies to 
megakaryocytic cells, a CMK cell preparation and a CMS cell preparation were used as the first 
immunogens; however, other immortalized megakaryocytic cells, such as Mo7e or DAMI cells, 
could have been used. Other monoclonal antibodies analogous to the agonist antibody of the 
invention BAH-1, which specifically recognizes the C-Mpl receptor, can be generated using 
membrane bound c-Mpl receptor protein as the immunogen. To generate agonist monoclonal 
antibodies against other cell types, other cells of hemopoietic lineage are chosen, e.g., stem cells, 
B cells or T ceils. In the first immunization step stem cells can be represented, e.g., by the 
immortalized cell line CTS; B cells by the immortalized cell lines ARH-77, SB or Nal-6; and T 
cells by the immortalized ceil lines Jurkat or H9. 

After a sufficient time, the animal is sacrificed and somatic antibody-producing cells may 
be derived from the lymph nodes, spleens and peripheral blood of primed animals. Spleen ceils 
are preferred. Mouse lymphocytes give a higher percentage of stable fusions with the mouse 
myelomas described below. The use of rat, rabbit, frog, sheep and other mammalian somatic 
cells is also possible. The spleen cell chromosomes encoding desired immunoglobulins are 
immortalized by fusing the spleen cells with myeloma cells, generally in the presence of a fusing 
agent such as polyethylene glycol (PEG). Any of a number of myeloma cell lines may be used as 
a fusion partner according to standard techniques; for example, the P3-NSl/l-Ag4-l, P3-x63- 
Ag8.653 or Sp2/0-Agl4 myeloma lines. These myeloma lines are available from the American 
Type Culture Collection (ATCC), 10801 Uni versity Boulevard, Manassas, Va. 201 1 0-2209. 

The resulting ceils, which include the desired hybridomas, are then grown in a selective 
medium, such as HAT medium, in which unfused parental myeloma or lymphocyte cells 
eventually die. Only the hybridoma cells survive and can be grown under limiting dilution 
conditions to obtain isolated clones. The supematants of the hybridomas are screened for the 
presence of antibody of the desired specificity, e.g., by immunoassay techniques such as those 
described herein, using the antigen that has been used for immunization. Positive clones can then 
be subcloned under limiting dilution conditions and the monoclonal antibody produced can be 
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isolated. Various conventional methods exist for isolation and purification of the monoclonal 
antibodies so as to free them from other proteins and other contaminants. Commonly used 
methods for purifying monoclonal antibodies include ammonium sulfate precipitation, ion 
exchange chromatography, and affinity chromatography. Hybridomas produced according to 
these methods can be propagated in vitro or in vivo (in ascites fluid) using techniques known in 
the art (see, generally, Harlow et al., Antibodies. A Laboratory Manual, Cold Spring Harbor 
Laboratory, pp. 1-726, 1988). 

Other embodiments of the instant invention provide methods for detecting DNA, RNA, 
and/or proteins from MTV in biological and/or other types of samples. 

One aspect of this embodiment of the invention provides a method for the detection of 
MTV DNA in a sample which comprises the following steps: 

i) obtaining an sample suspected of containing one or more of the MTV DNA 
sequences described above; 

ii) carrying out a polymerase chain reaction (PGR) to amplify a DNA sequence as 
defined in step i); and, 

iii) determining the sequence of, or otherwise characterizing, the amplicons (the PGR 
amplified DNA) produced in step ii) to determine whether or not the DNA 
sequence as defined in step i) is present in the sample. 

Another aspect of this embodiment provides a method for the detection of MTV RNA in 
a sample which comprises the following steps: 

i) obtaining an sample suspected of containing RNA which encoded by one or more 

of the MTV DNA sequences described above; 
iii) carrying out an RNAse protection assay (RPA); and, 

iii) analyzing the RPA results to determine whether RNA as defined in step i) is 
present in the sample and < ption > ig - lid RNA 

It will be recognized by those of ordinary skill that the selection of the parameters 
necessary for optimizing the steps of these methods are well within the abilities of the ordinarily 
skilled artisan. These parameters, for example PGR conditions (e.g. selection of annealing 
temperature, extension times, and primers) and DNA sequencing method, are routinely 
determined in labs where such molecular biological techniques are employed. 
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Another aspect of this embodiment of the invention provides for a method for analyzing a 
sample in order to determine whether the sample contains antibodies which recognize MTV 
polypeptides. This method comprises the steps of: 

i) obtaining a sample suspected of containing antibodies specific for mammary 
tumor viral antibodies; 

ii) obtaining at least one purified MTV polypeptide; 

iii) performing western immunoblot analysis using the sample of step i) and the 
polypeptide of step ii); and, 

iv) analyzing the results of step iii) to determine whether or not antibodies which 
specifically interact with the peptide of step ii) are present in the sample. 

Another aspect of this embodiment of the invention provides for a method for analyzing a 
ceil culture or a tissue sample by one or more immunohistological methods in order to determine 
whether the sample contains MTV proteins. This method comprises the steps of: 

i) obtaining a sample suspected of containing mammary tumor viral proteins; 

ii) preparing the sample of step i) for immunochemical analysis; 

iii) incubating the sample of step ii) with one or a combination of two or more 
monoclonal or polyclonal antibodies specific for MTV polypeptides encoded by 
one or more of the MTV DNA sequences described above, wherein the said 
antibodies optionally have a moiety which allows for their specific detection; 

iv) washing die samples to remove antibody which is not specifically bound: 

v) processing the samples as appropriate for the selected detection method; and, 

vi) analyzing the results of step v.) to determine whether or not MTV proteins are 
present in the sample. 

It is envisioned that the immunochemical analysis may include, but is not limited to 
analysis by western blotting or enzyme-linked immunosorbant assay. It will be recognized that 
the selection of the parameters necessary for optimizing the performance of these methods are 
within the abilities of the ordinarily skilled artisan. Analysis may be performed using either 
direct immunoehemistry (if the antibodies have been labeled with a detectable moiety) or by 
indirect immunoehemistry. 

Other embodiments of the current invention provide for pharmaceutical compositions 
comprising MTV DNA, RNA and/or proteins described supra. These compositions may further 
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comprise a pharniaceuticaliy acceptable excipient, carrier, or diluent and do not contain any 
biologically harmful substances, The pharmaceutical compositions of the present invention may 
be formulated by one having ordinary skill in the art., Suitable pharmaceutical formulations are 
described in Remington 's Pharmaceutical Sciences which is a standard reference text in the field 
which is here in incorporated by reference. 

The pharmaceutical compositions may further comprise coloring or stabilizing agents, 
osmotic agents, antibacterial agents, or any other substances as long as such substances do not 
interfere with the function of the composition. The pharmaceutical compositions of the 
invention, can, for example, be formulated as a solution, suspension, or emulsion in association 
with a pharmaceutical^ acceptable parenteral vehicle. Examples of such vehicles are water, 
saline, Ringer's solution, dextrose solution, and 5% human albumen. Liposomes may also be 
used. The vehicle may contain additives that maintain isotonicity (e.g., sodium chloride or 
mannitol) and chemical stability (e.g., buffers and preservatives). It should be appreciated that 
endotoxin contamination should be kept at a safe level, for example, less than 0.5 ng/mg protein. 
Moreover, for human administration, preparations should meet sterility, pyrogenicity, general 
safety and purity standards as required by the United States Food and Drug Administration 
Office of Biological Standards. The formulations may be sterilized by commonly used 
techniques such as filtration. 

The phrase "pharniaceuticaliy acceptable" refers to substances and compositions which 
do not produce an adverse, allergic, or otherwise untoward reaction when administered to an 
animal, or a human, as appropriate. A substance which caused produced any of these adverse 
effects would be classified as "biologically harmful" within the scope of the present invention. 
Pharmaceutical!}' acceptable substances and compositions include, but are not limited to 
solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
delaying agents. Except where incompatible with the invention the use of any conventional 
ingredient is contemplated. Furthermore, supplementary active ingredients which serve some 
other pharmacologically expedient purpose can also be incorporated into the instant 
compositions. 

The methods and compositions described above are contemplated to be of great benefit in 
helping to determine whether or not MTV or MTV viral components are present in a person's 
tissues. This knowledge is would be of aid in predicting a patient's susceptibility to cancer 



WO 01/00829 



PCT/US00/18279 



-17- 

etiologicaliy derived from MTV. The methods provide a means for determining a patients viral 
load (the amount of virus present in the person's tissues). In this respect it is envisioned that 
tremendous benefit would be achieved by adapting the methods herein described to the 
widespread screening of the general population, and in particular to all mature human females. 

Other compositions contemplated as pail of various embodiments of the instant invention 
provide a means for stabilizing or reducing the amount of virus present in a human or other 
animal. 

The various methods described above are readily adaptable to preparing diagnostic kits 
for detecting the presence of MTV DNA, RNA, and/or polypeptides. Consequently, for the 
clinical practice of the invention, yet other embodiments of the instant invention, which provide 
for diagnostic kits, are contemplated. Such kits are useful for the qualitative and quantitative 
analysis of a sample in order to detect, and perhaps determine the quantity of, MTV DNA, RNA, 
and/or polypeptide present therein. In one aspect of this embodiment of the invention the kit 
includes, as part of its components, one or more recombinant DNA molecules comprising one or 
more of the M TV DNA sequences described above. Also according to this aspect of the 
invention the kit may comprise, in addition to (or in alternative to) the recombinant MTV DNA. 
one or more synthetic oligonucleotide primer pairs useful for the PCR amplification of these 
MTV DNA sequences. 

Another aspect of this embodiment of the invention provides for a kit for detecting 
antibodies which specifically recognize mammary tumor virus proteins. As part of its 
components this kit includes a reagent comprising one or more polypeptides encoded by the 
MTV DNA sequences described above. 

Also contemplated are kits useful for detecting the presence of MTV proteins by 
inmiujiC)-.M.^hemi-i _v w in- h ^>mr>ri\c otic or more antibodies (either monoclonal or polyclonal) 
which are specific for at least one MTV polypeptide. Optionally, these antibodies may be 
modified so as to comprise a detection moiety (e.g., a fluorescent die or a radioactive isotope, 
such as 35 S). 

Kits according to these embodiments of the invention may comprise packages, each 
containing one or more of the various reagents (typically in concentrated form} which are 
required to perform the respective diagnostic tests. The kits according to these embodiments of 
the invention are contemplated to be useful for detecting and/or quantifying MTV DNA. RNA, 
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and/or protein in biological (or other types of) samples. Such samples may be selected from, but 
are not limited to the following: blood plasma or serum, whole blood, urine, sputum, colonic 
effluent, cerebrospinal fluid, lymphatic fluid, bone marrow, tissue samples (such as from a 
surgical biopsy), or any other sample suspected of containing these biological molecules. 

The kits also may further include orse or more fiduciary results. As used herein a 
"fiduciary result" refers to a reference standard against which a test outcome is compared to 
gauge the results in terms of quality and/or quantity. A "fiduciary 1 series" is a plurality of such 
references that represent points along a qualitative or a quantitative scale. Preferable in this 
regard, are kits that include a fiduciary series for interpreting results. The fiduciary may be in 
the form of one or more photographs or may be depicted in other ways, including written 
descriptions. 

Thus fiduciaries may be developed as part of the kits, in accordance with this aspect of 
the invention, to guide interpretation of results. In this regard, a RNA or polypeptide 
concentration from a biological sample may be characterized in accordance with the foregoing. 
Sample may be taken from representative cross-section of patients at various stages of disease 
(including asymptomatic or essentially disease free) to prepare a fiduciary series. 
Characterization in this regard may benefit from hindsight, by following the actual course of 
neoplastic progression in patients as they undergo diagnosis, treatment, and follow up thereafter. 
The determination of viral load (as determined by the abundance of MTV DNA, RNA, or 
polypeptide), as set out above, for a variety of patients of known breast cancer status and 
eventual outcome, and the subsequent correlation of these values is of incalculable prognostic 
value. This will help provide the patient with a more accurate prognosis, and it will also aid the 
patient and oncologist in determining the best course of therapeutic treatment, when such 
treatment is necessary. 

Another embodiment of the instant invention provides for a composition which induces 
an immunological response against MTV in an animal. A specifically contemplated aspect of 
this invention is a composition which induces an immunological response to human mammary 
tumor viral protein in humans (see Example 7). It is expected that such induced immunity may 
prevent breast cancer in individuals that carry endogenous MTV, by blocking the spread of the 
virus to honnonaliy sensitive tissues. 
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Methods for producing an immune response to viral DNA and/or RNA in animals are 
also known, see for example U.S. patent numbers 5,990,091 and 6,004,799 which are herein 
incorporated by reference. Thus compositions which elicit an immune response to MTV DNA 
and/or RNA are also contemplated as aspect of this embodiment of the invention. 

Other embodiments provide for therapeutic compositions which decrease the activity and 
presence of human mammary tumor virus. A common feature of all retroviruses is the presence 
of three enzymes involved in various stages of the viral replication cycle, the reverse 
transcriptase CR IT, protease (PR), and integrase (IN). Various pharmaceuticals have been 
developed which target the RT and PR of human immunodeficiency virus (HIV), a retrovirus 
distantl} related to MTV. Furthermore, chemicals which selectively inhibit HIV IN have been 
identified and prototype drags targeting this enzyme are under development (Hong et a!., 1998: 
Mathe, 1999; Robinson, 1998; Singh et al, 2000). According to one aspect of the instant 
invention a pharmaceutical composition comprising one or a mixture of two or more retroviral 
inhibitors in an amount effective to inhibit the activity or the spread of MTV's of the instant 
invention is provided. 

Identification of the drug or drugs for use in the pharmaceutical compositions of this 
embodiment can be made using techniques known to those skilled in the art. For examples of 
protease and reverse transcriptase inhibitors and methods of determining the efficacy of such 
drugs as retroviral inhibitors see U.S. Patent Nos. 5,858,738, 6,017,928, and 6,046,228, which 
are herein incorporated by reference. The various compositions of this embodiment of the 
invention may comprise, for example, currently known HIV integrase. protease, or reverse 
transcriptase inhibitors; alternatively, such inhibitors can be modified so as to increase their 
specificity for MTV's. Likewise, other classes of HIV inhibitors, such as peptides which are 
analogs of the transmembrane glycoprotein, can serve as pro-drugs to develop specific drags 
which decrease activity and presence of MTV in humans and other species. These compositions 
are contemplated as being useful to inhibit the activity and spread of MTV in humans and other 
species which carry these viruses as endogenous genetic, elements. Such drugs could be used to 
treat breast cancer patients afflicted with MTV and would be expected to ameliorate the severity 
and to reduce the recurrence of disease. In addition, such drugs could be used prophylactically to 
prevent the occurrence of individuals at high risk for developing the disease. 
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EXAMPLES 

The following examples are included to demonstrate preferred embodiments of the 
in vention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

EXAMPLE 1: MMTV-relatcd sequences in humatis 

Sequences which axe highly similar (>95%) to the MMTV env gene were amplified by 
PGR from human DNA samples, including subsets of both BC (breast cancer) tissue and non-BC 
tissues. The MMTV -related sequences by were found by PGR and a sensitive blotting technique 
not only in breast tumors (see Figure 1), but also in the blood of a subset of healthy controls (see 
Figure 2), and systemic lupus erythematosus (SLE) patients without breast cancer. Our results 
differ from those of Wang and coworkers (1995) who. with few exceptions, were able to detect 
MMTV-iike sequences only in breast tumors. The sequences from human DNA were distinct 
from the MMTV sequences used as controls in these PGR reactions indicating that our results are 
not simply due to contamination. A ribonuclease protection assay was used to confirm these 
results using a non-PCR based technique to determine that the majority of the PGR positive BC 
tissues, but none of the PGR. negative tissues, expressed this sequence at the mRNA level. Many 
of the products from these PGR reactions have been sequenced. Analysis of these sequences 
provides further strong evidence that PGR contamination is an unlikely explanation for the 
observed results. MMT V env-like sequences from different individuals derived in the same PGR 
run were distinct from each other. This result indicates the lack of an ubiquitous PGR 
contaminant that would have produced a more consistent sequence that should have been 
identical (or nearly so) in the various reaction tubes. Furthermore DNA of individual subjects 
produced internally consistent MMTV env-like sequences from PGR run to PGR run. The 
variations within the MMTV-related sequences from a given patient may represent a low number 
of Taq errors, but are also suggestive of variations expected of a replicating retrovirus (reverse 
trascripiase errors). 
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Example 2: RNase Protection Assay determination of MTV RNA levels 

Ribonuciease protection assay (RPA) is a quantitative assay which is frequently used by 
those of ordinary skill in the art to determine the of levels of specific RNA species without PGR 
amplification. Briefly, it is performed as follows for the transcripts of the present invention: a 
probe of uniform length is synthesized by /« vitro transcription of a cloned template and labeled 
to high specific activity. f33l S-labeled riboprobes were prepared from plasmids containing the 
cloned MTV fragments in the 150 to 400 bp size range. The probe was then hybridized to test 
RNA, and RNA fragments that remain unhybridized are not protected from digestion by RNAse 
A/Tl. The fragments of the labeled riboprobe protected by hybridized RNA were visualized and 
analyzed following separation on a denaturing polyacrylamide gel. Levels of MTV-related 
mRNA are compared in each sample to levels of mRNA produced by the p-actin gene, a 
"housekeeping" gene to ensure integrity of the RNA sample (i.e., that the RNA has not been 
degraded). RPA detected RNA in 2 out of 3 PGR positive breast tumors (see Figure 3). RPA is 
10-15 times more sensitive than "northern" analysis for detection of rare messenger RNA. RPA 
analysis is performed directly on total RNA, without any prior manipulation, which can 
introduce errors in the quantitative analysis. 

Example 3: RT-PCR determination of MTV RNA levels 

MTV mRNAs may also be detected using real time RT-PCR (reverse transcriptase linked 
polymerase chain reaction). For example the I-Cycler (Biorad). with a fluoroscopic detection 
facility is capable of determining the real time kinetics of the PGR amplification product by 
quantifying the PGR product in the log-linear phase of the PGR reaction. Using this method, a 
real-time thermocycler can reliably compare miniscule amounts of known nucleotide template in 
a reproducible fashion. 

Example 4: Detection of anti-MTV antibodies in blood by Westers? blot analysis 

The InsectSelect. ,M System (Invitrogen) which allows stable production of recombinant 
proteins in insect cells in a manner similar to we 1 o* xpres on systems 

may be used to generate recombinant MTV proteins. This is a vims-free system which allows 
creation of stable cell lines that continuously produce high-quality protein. Stable cell, lines 
generated in about 9 days may be used for continuous, long-term production of recombinant 
proteins. The InseetSeiect™ expression, system is based on a plasmid vector which carries an 
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antibiotic resistance gene for selecting stably expressing insect cell lines. This expression vector 
uses the immediate early promoter, OpiE2, from the Douglas Fir Tnssoe moth OpMNPV 
baculovirus for gene expression. OpIE2 is a strong transcriptional promoter in lepidopterin (Sf9, 
5121, High Five™ (Invitrogen)) as well as mosquito and dipterin cell lines. The expression 
vector pIZ/V5-His (Invitrogen) has a multiple cloning site and several features that simplify 
production and analysis of recombinant proteins in insect cells, ft includes a Zeocin-resistance 
gene that allows for rapid selection of stably transfected ceils, a C -terminal tag encoding the V5 
epitope, and a C -terminal polybistidine (6xHis) sequence. These latter features facilitate rapid 
protein detection with anti-V5 antibodies (Invitrogen) and protein purification with resins that 
bind the polybistidine. Alternative eukaryotic expression systems for MMTV related proteins 
are an in vitro rabbit reticulocyte lysate system (Promega) with in vitro transcribed and capped 
mRNA or the Sindbis virus Expression system (Invitrogen). 

MMTV -related genes may also be cloned into a pGEX vector (Pharmacia) and expressed 
in bacteria to make recombinant fusion protein with glutathione S-transferase (GST) protein for 
immunoblot studies. This system has several advantages compared to other methodologies of 
protein expression, not the least of which is the ease of production, isolation, and purification of 
the recombinant protein. Fusion proteins generated with the GST protein typically remain 
soluble permitting the recovery from cell iysates. Denaturing conditions are not required during 
purification and therefore the antigenic and enzyme properties of the protein are often 
maintained. Moreover, the GST fusion protein may be efficiently and rapidly purified by 
immobilizing the GST on glutathione coated beads or columns and eluted with reduced 
glutathione. 

To perform a western immunoblot, MTV proteins are dissolved in lysis buffer (0.25 M 
Tris Base, pH 6.8, containing 4% sodium dodecyl sulfate, 10% dithiothreitol, 20% glycerol and 
0.01% w/v bromophenol blue), heated to 100°C for 3 min and subjected to sodium dodecyl 
sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) on a 10% poly aery lamide gel. These 
proteins may then be transferred electrophoretically to a nitrocellulose membrane (Protran™; 
Schleicher & Schuell) in Tris-glycine (pH 8.3) buffer with 20% methanol. The blots may be 
incubated overnight in blocking buffer (0.02 M Tris, 0.1 M NaCl, heat inactivated goat serum, 
0.01% thimerosal, and 5% nonfat dry milk) with serum or plasma from the human or animal 
being tested (1:100 dilution or optimal dilution). Incubation with secondary antibodies, 
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biotinylated anti-human (or other species) IgG goat antibodies diluted 1:500 or 1:1,000 in 
blocking buffer, and with avidin-horseradish peroxidase may be performed at room temperature 
for 2 hours. The immuno blots may be developed with 7.8 mM 4-chloro-l-napthol and 0.03% 
hydrogen peroxide. Bands corresponding to MTV proteins are quantitated by scanning and 
processed with image analysis software (NIH Image). 

Example 5: Detection of anti-MTV antibodies by enzyme linked immunoassay 

Antibodies which specifically bind to MTV protein(s) may be detected using an enzyme- 
linked immunoassay with MTV protein or proteins as the target for the antibodies. In this 
technique MTV proteins produced in an expression system, such as the baculovirus/insect cell 
clutter system or purified from virus preparations may be bound to the bottom of wells in a 
multiwell plastic microtiter plate. Serum or plasma from humans or other species is diluted to an 
empirically determined optimum dilution and incubated from 1 hour to overnight at about 25 °C 
(room temperature). The wells are then washed three times in a saline solution containing 
Tween* 20 (Aldrich) or NP-40 (currently available as Igepal™ CA-630, Sigma) detergents using 
an automated plate washer. Antibodies bound to the MTV proteins may then be detected by 
reacting the wells sequentially with buffers containing biotinylated goat anti-human 
immunoglobulin, followed by avidin coupled to horseradish peroxidase, and finally 3 ,3', 5,5'- 
tetramethyl benzidine a substrate for horseradish peroxidase which produces a colored reaction 
product. Between each step the well is typically washed three times in a saline solution 
containing Tween® 20 or NP-40 detergents using an automated plate washer. The colored 
reaction produced in each well is quantitated using a spectrophotometer plate reader. The 
amount of colored reaction product is proportional to the amount of antibodies to MTV present 
in the original sample. 

Example 6: Detection of MTV retroviral proteins in situ using tmmunohistochemistry 

MTV retroviral proteins may be detected in tissue samples by immunohistochemical 
assays. In this technique MTV proteins produced in an expression system, such as the 
baculovirus/insect ceil clutter system or purified from virus preparations may be used to generate 
monoclonal or polyclonal antibodies by methods well known to those skilled in the art. These 
antibodies may then be labeled with a detection moiety such as biotin, fluorescein, horseradish 
peroxidase or other labels used for this purpose. The labeled antibody preparations are then 
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diluted to an empirically determined optimum dilution and incubated with fixed or frozen thin 
sections or cell preparations from samples to be tested for the presence of MTV proteins. The 
samples are processed, as dictated by the type of label used, and the binding of the antibody to 
MTV retroviral proteins may be detected by microscopy, autoradiography, flow cytometry, etc., 
5 again, as dictated by the identity of the labeling moiety. 

Example 7: Preparation of a pharmaceutical composition capable of eliciting an 

immunological response, against human mammary tumor virus, in humans. 

A pharmaceutical composition to be used to induce an immunological response an in 

humans or animals to MTV protein may be developed as follows. MTV proteins produced in an 

10 expression system, such as the baculovirus/insect cell culture system described above or purified 
from virus preparations may be extensively purified using column chromatography and/or any 
other methodologies commonly employed by skilled artisans. The purified MTV proteins may 
then be mixed or emulsified with a suitable adjuvant preparation (currently alum is the only 
adjuvant approved for use in humans) to produce a vaccine. This immunogenic composition 

is may then be injected subcutaneously or intradermally into the target animal. 

All of the composition and methods disclosed and claimed herein can be made and 
executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to the 

20 compositions and methods and in the steps or in the sequence of steps of the method described 
herein without departing from the concept, spirit and scope of the invention. More specifically, 
it will be apparent that certain agents which are both chemically and physiologically related may 
be substituted for the agents described herein while the same or similar results would be 
achieved. All such similar substitutes and modifications apparent to those skilled in the art are 

?.5 deemed to be within the spirit, scope and concept of the invention as defined by the appended 
claims, 
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1 . An isolated DNA molecule comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8,21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:?:, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ I D NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 1 0. 

2. The isolated DNA molecule of claim 1 which comprises SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 
10, 21, 23, 25, 27, or 29. 

3. The isolated DNA moiecuie of claim 1 wherein the DNA sequence is derived from 
human, rhesus macaque, or cat. 

4. The isolated DNA molecule of claim 1 , further comprising a detection moiety. 

5. The isolated DNA molecule of claim I , which is suspended or dissolved in a diluent. 

6. The isolated DNA moiecuie of claim 5 wherein the diluent is a buffered aqueous 
solution, 
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aic017.The isolated DNA molecule of claim 5 wherein the DNA is present at a concentration of 
from about 0.1 ng/pl to 100 jug/pi 

8. A purified polypeptide encoded by a DNA molecule comprising a DNA sequence that 
has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NQ:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1 -380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NQ:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:2L at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 10. 

9, The purified polypeptide of claim 8 which is SEQ ID NO: 12, 13, 14, 15, 16, 17, 18, 20, 
22, 24, 26, 28, or 30. 

ac03 1 0. The purified polypeptide of claim 8 which is present in a pharmaceutical 

composition. 

II. An antibody which recognizes a polypeptide encoded by a DNA molecule comprising a 
DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
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nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO: 5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO: 8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:2L at least 170 contiguous nucleotides of SEQ ID 
NO:23, at. least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 1 0; 

wherein said, antibody was generated using said polypeptide. 

The antibody of claim 1 1 which is a monoclonal antibody. 

An RNA corresponding to a DNA molecule a DNA molecule comprising a DNA 

sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2. at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1 -380 of SEQ ID NO:3. at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO:7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO:8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
3MO:23, at least 100 contiguous nucleotides of the sequence represented by 
nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
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sequence represented by nucleotides 85-462 of SEQ ID NO:27. or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 10. 

The isolated DNA molecule of claim 1 which is incorporated in a vector; 
wherein said DNA sequence is under the transcriptional control of a heterologous 
promoter. 

The vector of claim 14 which is capable of expressing the DNA sequence in at least, one 
of the following cell types: insect cells, bacterial cells, avian cells, yeast cells, or 
mammalian cells. 

The vector of claim 34 wherein said vector is capable of episomal replication or 
chroroosoi ni :n o •* :u->n in at least one of the following cell types: insect cells, bacterial 
ceils, avian cells, yeast ceils, or mammalian cells. 

A method for the detecting the presence of a mammary tumor viral DNA comprising the 
steps of: 

i) obtaining a biological sample suspected of containing DNA which comprises a 
sequence molecule a DNA molecule comprising a DNA sequence thai has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4. 5. 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous 
bases of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence 
represented by nucleotides 1-380 of SEQ ID NO;3, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-400 of SEQ ID NO:4, at 
least 130 contiguous nucleotides of the sequence represented by nucleotides 1-360 
of SEQ ID NO:5, at least 130 contiguous nucleotides of SEQ ID NO:6, at least 
140 contiguous nucleotides of the sequence represented by nucleotides 1-380 of 
SEQ ID NO:7, at least 210 contiguous nucleotides of the sequence represented by 
nucleotides 20-462 of SEQ ID NO;8, at least 160 contiguous nucleotides of the 
sequence represented by nucleotides 97-462 of SEQ ID NO:2 L at least 170 
contiguous nucleotides of SEQ ID NO:23. at least 100 contiguous nucleotides of 
the sequence represented by nucleotides 337-462 of SEQ ID NO:25, at least 300 
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contiguous nucleotides of the sequence represented by nucleotides 85-462 of SEQ 

ID NO:27. or at least 200 contiguous nucleotides of SEQ ID NO:29; or 
c) at least 92% identity with SEQ ID NO : 1 0; 
si) carrying out a polymerase chain reaction to amplify the suspected DNA sequence 

as defined in step i); and, 
Hi) determining the sequence of, or otherwise characterizing, the ampiicons produced 

in step ii) to determine whether or not the DNA sequence as defined in step i) is 

present in the sample. 

The method of claim 17 wherein the biological sample is obtained from human, rhesus 
macaque, or cat. 19. A method of detecting the presence of antibodies which recognize 
one or more mammary tumor virus polypeptides, the method comprising the steps of: 

i) obtaining a sample suspected of containing antibodies specific for mammary 
tumor viral antibodies; 

ii) obtaining at least one purified polypeptide encoded by a DNA molecule 
comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4. 5. 7, 8, 21,25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 

^.i x v. i i t t i ot i 

sequence represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 
contiguous nucleotides of the sequence represented by nucleotides 1-400 of 
SEQ ID NO:4, at least 130 contiguous nucleotides of the sequence represented 
by nucleotides 1-360 of SEQ IDNO:5, at least 130 contiguous nucleo tides of 
SEQ ID N():6, at least 140 contiguous nucleotides of the sequence represented 
by nucleotides 1-380 of SEQ ID NO:7, at least 210 contiguous nucleotides of 
the sequence represented by nucleotides 20-462 of SEQ ID NO: 8, at least 160 
contiguous nucleotides of the sequence represented by nucleotides 97-462 of 
SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID NO:23, at least 
100 contiguous nucleotides of the sequence represented by nucleotides 337- 
462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the sequence 
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represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29; or 
e) at least 92% identity with SEQ ID NO: 10; 

iii) performing an immunochemical analysis using the sample of step i) and the 

eptide of step ii); and, 

iv) analyzing the results of step iii) to determine whether or not antibodies which 
specifically interact with the peptide of step ii) are present in the sampie. 

The method of claim 19 wherein the immunochemical analysis of step iii) is a Western 
blot analysis, 

The method of claim 39 wherein the immunochemical analysis of step iii) is 
enzyme-linked immunosorbant ass uysis 

The method of claim 19 wherein one or more of the polypeptides are derived from the 
em, or gag, gene and the antibodies are specific for one or more of the following 
polypeptides SEQ ID NO: 12, 13, 14, 15, 16, 17, 18, 20, 22, 24, 26, 28, or 30. 

The method of claim 19 wherein the sample of step ii) comprises antibodies derived from 
a human, rhesus macaque, or cat. 

A diagnostic kit for detecting DNA or RNA from a mammary tumor virus, said kit 
comprising a reagent of a DNA molecule that comprising a DNA sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 7, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous bases 
of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence represented by 
nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous nucleotides of the 
sequence represented by nucleotides 1-400 of SEQ ID NO:4, at least 130 contiguous 
nucleotides of the sequence represented by nucleotides 1-360 of SEQ ID NO:5, at 
least 130 contiguous nucleotides of SEQ ID NO:6, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-380 of SEQ ID NO: 7, at 
least 210 contiguous nucleotides of the sequence represented by nucleotides 20-462 
of SEQ ID NO: 8, at least 160 contiguous nucleotides of the sequence represented by 
nucleotides 97-462 of SEQ ID NO:21, at least 170 contiguous nucleotides of SEQ ID 
NO:23, at least 100 contiguous nucleotides of the sequence represented by 
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nucleotides 337-462 of SEQ ID NO:25, at least 300 contiguous nucleotides of the 
sequence represented by nucleotides 85-462 of SEQ ID NO:27, or at least 200 
contiguous nucleotides of SEQ ID NO:29: or 
c) at least 92% identity with SEQ ID NO:10. 

A diagnostic kit for detecting antibodies io mammary tumor virus, where said kit 
comprises a reagent comprises one or both of the following: 

i) one or more polypeptides encoded by a DNA molecule comprising a DNA 
sequence that has: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5, 1, 8, 21, 25, or 27; or 

b) at least 99% identity with one or more of the following: at least 340 contiguous 
bases of SEQ ID NO:2, at least 150 contiguous nucleotides of the sequence 
represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-400 of SEQ ID NO:4, at 
least 130 contiguous nucleotides of the sequence represented by nucleotides 1-360 
of SEQ ID NO:5, at least 130 contiguous nucleotides of SEQ ID NG:6, at least 
140 contiguous nucleotides of the sequence represented by nucleotides 1-380 of 
SEQ ID NO:7, at least 210 contiguous nucleotides of the sequence represented by 
nucleotides 20-462 of SEQ ID NQ:8, at least 160 contiguous nucleotides of the 
sequence represented by nucleotides 97-462 of SEQ ID NO:2.1, at least 170 
contiguous nucleotides of SEQ ID NO:23, at least 100 contiguous nucleotides of 
the sequence represented by nucleotides 337-462 of SEQ ID NO:25, at least 300 
contiguous nucleotides of the sequence represented by nucleotides 85-462 of SEQ 
ID NO:27, or at least 200 contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 1 0; or 

ii) an antibody specific for a polypeptide according to step i). 

A method for the detection of MTV RNA in a sample which comprises the following 
steps: 

i) obtaining an sample suspected of containing RNA which encoded by one or more 
DNA sequences that have: 

a) at least 99% identity with SEQ ID NO: 2, 3, 4, 5,7,8,21,25, or 27; or 
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b) at least 99% identity with one or more of the following: at least 340 contiguous 
bases of SEQ ID NO:2, at least. 150 contiguous nucleotides of the sequence 
represented by nucleotides 1-380 of SEQ ID NO:3, at least 140 contiguous 
nucleotides of the sequence represented by nucleotides 1-400 of SEQ ID NO:4, at 
least 130 contiguous nucleotides of the sequence represented by nucleotides 1-360 
of SEQ ID NO:5, at least 130 contiguous nucleotides of SEQ ID NO:6, at least 
140 contiguous nucleotides of the sequence represented by nucleotides 1-380 of 
SEQ ID NO:7, at least 210 contiguous nucleotides of the sequence represented by 
nucleotides 20-462 of SEQ ID NO:8, at least 1 60 contiguous nucleotides of the 
sequence represented by nucleotides 97-462 of SEQ ID NO:21, at least 1 70 
contiguous nucleotides of SEQ ID NO:23, at least 100 contiguous nucleotides of 
the sequence represented by nucleotides 337-462 of SEQ ID NO:25, at least 300 
contiguous nucleotides of the sequence represented by nucleotides 85-462 of SEQ 
ID NO:27, or at least 200 contiguous nucleotides of SEQ ID NO:29; or 

c) at least 92% identity with SEQ ID NO: 3 0;; 

iv) carrying out an RMAse protection assay (RPA); and, 

hi) analyzing the RPA results to determine whether RNA as defined in step i) is 
present in the sample and optionally quantitating said RNA. 
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SEQUENCE LISTING 

<110> Garry, Robert F. 
5 <120> Human Endogenous Retrovirus in Breast Cancsr 
<I30> TXJMC : 012 



<170> Patenrin Ver. 2.1 



<210> 1 
<211> 461 
<212> DNA 

<213> Mox < i i tumor a; us 
<40 0:> 1 

tcccttccct cgcctagtgt agatcagtca gatcagatta aaagcaaaaa ggatctattt SO 
ggaaattata ctccccctgt caataaagag gttcatcgat ggtatgaagc aggatgggta 12 0 
gaacctacat ggttctggga aaattctcct aaggatccca atgatagaga ttttactgct ISO 
ctagttcccc atacagaatt gtttcgctta gttgcagcct caagatatct tattctcaaa 24 0 
aggccaggat ttcaagaaca tgacatgatt cctacatctg cctgtgttac ttacccttat 300 
gccatattat taggattacc tcagctaata gatatagaga aaagaggatc tacttttcat 3S0 
atttcctgtt cttcttgtag attgactaat tgtttagatt cttotgccta ogaotatgca 42 0 
gcgatcatag tcaagaggcc gccatacgtg ctgctacctg t 461 



<210> 2 

<211> 461 

<212> DNA 

<213> Human Mammary Turccr Virus 



tcccttccct 


cgcctagtgt 


agatctgtca 


gatcagatta 




ggatctattt 


60 


ggaaattata 


ctccccctgt 


caataaagag 


gttcatcgat 


ggtatgaagc 


aggatgggta 


120 


gaacctacat 


ggttctggga 


aaattctcct 




atgatagaga 


ttttactgct 


180 


ctagttcccc 




gtttcgctta 


gttgcagcct 






240 




ttcaagaaca 






cctgtgttac 




300 


gccatattat 


taggattacc 


tcagctaata 


gatatagaaa 


aaagaggatc 


tacttttcat 


360 


atttcctgtt 


cttcttgtag 


attgactaat 


tgtttagatt 


cttctgccta 


cgactatgca 


420 


gcgatcatag 


;:caagsaggc gccatatgtg 


ctgctacctg 


t 




461 



t 

<400> 3 

tcccttccct cgcctagtgt agatcagtca gatcagatta aaaacaaaaa ggatctattt 6 0 
ggaaattata ctccccctgt caataaagag gttcatcgat ggtatgaagc aggatgggta 120 
gaacctactt ggttctggga aaattctcct aaggatccca atgatagaga ttttactgct 130 
ctagttcccc atacagaatt gtttcgctta gttgcagcct caagacatct tattctcaaa 240 
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aagccaggat ttcaagaaca tgagatgatt. cct.acat.ctg cctgtgttac ttacccttat 300 

gccatattat taggattacc tcagctaata gatatagaga aaagaggatc tacttttcat 360 

atttcctgtt cttcttgttg attgactaat tgtttagatt cttctgccta cgactatgca 420 

gcgatcatag tcaagaggcc gccatacgtg ctgcta cctg t 461 



<210> 4 
<211> 461 
<212> DNA 
<213> Human i 



tcccttccct cgcctag ta i. agaacagtca aatcagatta aaagcaaaaa ggacccaccc 60 

ggaaattata ctccccctgc caataaagag gttcatcgat ggtatgaagc aggatgggta 120 

gaacctactt ggttctggga aaattctcct aaggatccca atgatagaga ttttactgct 18 0 

ctagttcccc at.acagaatt gt.ttcgct.ta gttgcagcct caagacatct tattctcaaa 24 0 

aggccaggat ttcaagaaca tgagatgatt cctacatct.g cctgtgttac ttacccttat 300 

gccatattat taggattacc tcagctaata gatatagaga agagaggatc tacttttcat 360 

atttcctgtt cttcttgtag actgaccaat tgttcagact cttctgccta cgactatgca 420 

gcgatcatag tcaagaggcc gccatacgtg ctgctacctg t 461 



<210> 5 
<211> 461 
<212> DNA 

<213> Rhesus Mammary Tumor Virus 
<400> 5 

tcccttccct cgcctagtgt agatcagtca aatcagatta aaagcaaaaa ggatctattt SO 
ggaaattata ctccccctgt caataaagag gttcatcgat 
gaacctactt ggttctggga aaattctcct aaagatceca atgata; 
ctagttcccc atacagaatt gtttcgctta gttgcagcct caagacatct tattctcaaa 24 0 
aagccaggat ttcaagaaca tgagatgatt cctacatctg cctgtgttac ttacccttat 300 
gccatattat taggattacc tcagctaata gatatagaga aaagaggatc tacttttcat 360 
atttcctgtt cttcttgtag attgactaat tgtttagatt cttctgccta cgactatgca 420 
gcgatcatag tcaagaggcc gccatacgtg ctgctacctg t 461 



<210> 6 
<2il> 461 
<212> DNA 

<2i:i> P.hesc:; Mammary T'jmcr Virus 
<400> 6 

tcccttccct cgcctagtgt agatcagtca aatcagatta aaagcaaaaa ggatctattt 60 
ggaaattata ctccccctgt caataaagag gttcatcgat ggtatgaagc •: 
gaacctactt ggttctggga aaattctcct aaagatceca atgatagaga • 
ctagttcccc atacagaatt gtttcgctta gttgcagcct caagacatct tattctcaaa 240 
aagccaggat ttcaagaaca tgagatgatt cctacatctg cctgtgttac ttacccttat 300 
gccatattat taggattacc tcagctaata gatatagaga aaagaggatc tacttttcat 360 
atttcctgtt cttcttgtag accgactaat tgtttagatt cttctgccta cgactatgca 420 
gcgatcatag tcaagaggcc gccatacgtg ctgctacctg t 461 



aggatgggta 120 

ttactget 180 



55 <210> 7 
<211> 461 
<212> DNA 
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<2±i> Cat Mammary Ttimor Virus 
<400> 7 

tcccttccct cgcctagtgt agaacagtca 

5 ggaasttata ctccccctgt caataaagag 

gaacctacac ggttctgqqa aaattctcct. 

ctagttcccc atacagaatt gtttcgcfcta 

i gga t gasca t ig.at.gatt. 

gccatattat taggattacc tcagctaata 

10 atttcctgtt cttcttgtag attgactaat 

gcgatcatag tcaagaggcc gccatacgtg 



gatcagatta aaagcaaaaa ggat:t:tactt 60 
gttcatcgat ggtatgaagc aggatgggta 12 0 
aaggatccca atgatagaga ttttattgct 180 
gttgcagcct caagacatct tattctcaaa 240 
cct.acat.ctg cctgtgttac ttacccttat 300 
gatatagaga aaagaggatc tacttttcat 36 0 
tgtttagatt cttctgccta cgactatgca 420 
ctgctacctg t 4 61 



15 <211> 461 
<212> DNA 

<213> Cat Mammary Tumor Virus 
c400> 8 

20 tcccttccct cgcctagtgt agaacagtca 
ggaaattata ctccccctgt caataaagag 
gaacctacat ggttctggga aaattctcct 
ctagttcccc atacagaatt gtttcgctta 
aagccaggat ttcaagaaca taagatgatt 

25 gccatattat taagattacc tcagctaata 
atttcctgtt cttcttgtag attgactaat 
gcgatcatag tcaagaggcc gccatacgtg 



gatcagatta aaagcaaaaa ggatctacct GO 
gttcatcgat ggtatgaagc aggatgggta 12 0 
aaggatccca atgatagaga tcttactgct 180 
gttgcagcct caagacatct tattctcaaa 24 0 
cctacatctg cctgtgttac ttacccttat 300 
gatatagaga aaagaggatc tacttttcat 360 
tgtttagatt cttctgccta cgactatgca 42 0 
ctgctacctg t 461 



<210> 9 
<211> 104 
<212> DNA 
<213> Mouse 

<400> 9 

atgafcgccga gaggagaagg gtcagatata ttgatcaagc aattggcatg ggaaaatgca 60 
aattcattgt gtcaggatct. catccgccca atacgaaaaa cagg 104 



<212> DNA 

<213> Cat Mammary Tumor Virus 
<400> 10 

atgat jca a 
aattcattgt gccaagatct tatccgtcca atacgaaaaa cagg 

<210> 11 
<211> 153 



<400> 11 

Ser Leu Pro Ser Pro Ser Val .Asp Gin Ser Asp Gin lie Lys Ser Lys 
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Lys Asp Leu Phe Qly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 

20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 
65 70 75 80 

Arg Pro Gly Phe Gin Glu His Asp Met He Pro Thr Ser Ala Cys Val 
85 90 95 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu lie Asp lie 



20 Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
13 0 135 14 0 



<211> 153 
<212> PRT 

<213> Human Mammary Tumor Virus 
35 <400> 12 

Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin He Lys Ser Lys 

1 5 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 

40 2 0 25 3 0 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 

35 40 45 

45 Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

Thr Glu .Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 



Arg Pro G.I y Phe Gin Glu His Asp Met He Pro Thr Ser Ala Cys Val 
85 SO 95 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 

100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
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Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie lie Val 
13 0 13 5 .14 0 

Lys Lys Ala Pro Tyr Val Leu Leu Pro 



10 <210> 13 
<211> 152 
<212> PRT 

<213> Human Mar"~ary Turr.or Virus 
15 <400> 13 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asp Gin lie Lys Asn Lys 

I 5 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Giu Val His 
20 2 0 2 5 3 0 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

25 Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 SO 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu lie Leu Lys 

65 70 75 80 

30 

Lys Pro Gly Phe Glr; Glu His Glu Met lie Pro Thr Ser Ala Cys Val 



Thr Tyr Pro Tyr Ala lie Leu Leu Gly Leu Pro Gin Leu lie Asp lie 

100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Leu Thr 
115 120 12 5 

Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie lie Val Lys 
130 135 140 

Arg Pro Pro Tyr Val Leu Leu Pre 
14 5 150 



<210> 14 
<211> 153 
<212> PRT 

c2.1i> Human Mammary 



Ser Leu Pro Ser Pro Ser lie Glu Gin Ser Asn Gin He Lys Ser Lys 
15 10 15 

Lys Asp Leu Leu Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
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Arg Trp Tyr QIu Ala Oly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 

3 5 4 0 4 5 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 

50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu lie Leu Lys 

65 70 75 80 

Arg Pro Gly Phe Gin Glu His Glu Met lie Pro Thr Ser Ala Cys Val 



Thr Tyr Pro Tyr Ala lie Leu Leu Gly Leu Pro Gin Leu lie Asp lie 
100 105 110 

Glu Lys Arg Gly Ser Thr Phe His lie Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Als Ala lie He Val 
130 135 140 



Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 150 



<210> 15 
<211> 153 
<.212> PRT 

30 <213> Rhesus Mammary Tumor Virus 
<400> 15 

Ser Leu Pro Ser Pro Ser Val Asp Gin Ser Asn Gin He Lys Ser Lys 
1 5 10 15 

35 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 

20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
40 3 5 4 0 4 5 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

45 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 
65 70 75 SO 

Lys Pro Gly Phe Gin Glu His Glu Met He Pro Thr Ser Ala Cys Val 



Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
100 105 110 

Glu Lys Arg Gly Ser Thr Phe Has He Ser Cys Ser Ser Cys Arg Leu 
115 12 0 12 5 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
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Lys . 
14 5 



Lys Asp Lev Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 

20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 

20 3 5 -10 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 

50 55 60 

25 Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu lie Leu Lys 

SS 70 75 80 

Lys Pro Gly Phe Gin Glu His Glu Met lie Pre Thr Ser Ala Cys Val 



Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu lie Asp He 
100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Pro 



. Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
135 14 0 

< Tyr Val Leu Leu Pro 
150 



45 <211> 153 
<212> PRT 

<213> Cat Mammary '. 



• Leu Pro Ser Pro Ser Val Glu Gin Ser Asp Gin He Lys Ser Lys 



Arg Trp Tyr Glu Ala Gly Trp 1 
35 
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Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 SO 

5 Thr Glu Leu Phe Arg Leu Vai Ala Ala Ser Arg His Leu lie Leu Lys 



Lys Pro Gly Phe Gin Glu His Lys Met lie Pro Thr Ser Ala Cys Val 
35 SO 95 

Thr Tyr Pro Tyr Ala lie Leu Leu Arg Leu Pro Gin Leu lie Asp lie 
100 105 110 

Glu Lys Arq Gly Ser Thr Phe His Tie Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie lie Val 
130 135 140 



Lys Arg Pro Pro Tyr Val Leu Leu Pro 
145 ISO 



<210> 18 
25 <211> 153 
<212> PRT 

<213> Cat. Mammary Tumor Virus 



<400> 13 

30 Ser Leu Pro Ser Pro Ser Val Glu Gin Ser Asp Gin lie Lys Ser Lys 



Lys Asp Leu Leu Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 

20 25 3 0 

35 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 

40 50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg His Leu He Leu Lys 

65 70 75 80 

45 Lys Pro Gly Phe Gin Glu His Lys Met He Pro Thr Ser Ala Cys Val 



Thr Tyr Pro Tyr Ala lie Leu Leu Arg Leu Pro Gin Leu He Asp lie 
100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
115 12 0 12 5 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie He Vai 
13 0 ' 135 140 

Lys Arg Pro Pro Tyr Vai Leu Leu Pro 
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<212> PRT 

<213> Mouse mammary tumor virus 
<400> 19 

Met Met Pro Arg Gly Qlu Gly Ser Asp lie Leu lie Lys Gin Leu Ala 

1 5 X0 15 

Trp GIu Lys Cys Lys Phe He Val Ser Gly Ser His Fro Pro Asn Thr 



Lys Asn. Arg 
35 



20 <210> 20 
<211> 35 
<212> PRT 

<213> Cat Mammary Turner Virus 
2.5 <400> 20 

Met Met. Pro Arg Gly Glu Gly Ser Asp He Leu He 



Tyr Lys Lys Cys Lys Phe He Val Pro Arg Ser Tyr Pro Ser Asn Thr 



Lys Asn Arg 



<210> 21 
<211> 460 
<212> DNA 

<213> Human Mammary Tumor Virus 
<400> 21 

tcccttccct cgectagtgt agatcagtca gatcagatta aaagcaaaag gatctatttg 60 
gaaattatac tccccctgtc aataaagagg ttcatcggtg gtatgaagca ggatgggcag 120 
aacctacatg gttctgggaa aattctccta . tttactgetc 180 

tagttcccca tacagaattg tttegcttag ttgcagcctc aagatatctt attctcaaaa 240 
ggecaggatt ccaaggacat gacatgattc ctacatctgc ctgtgttact tacccttatg 300 
ccatattatt aggattacet cagctaatag atatagaaaa aagaggatct acttttcata 36 0 
tttcctgttc ttctt.gt.aaa ttgactaatt gtttagattc ttctgcctac gaetatgeag 42 0 
cgatcatagt caagaaggcg ccatatgtgc tgctacctgt 460 



<211> 38 
<2X2> PRT 

<213> Human Mammary Tumor virus 



<400> 22 
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Ser Leu Pro Ser Pro Ser Val Asp 
1 5 

Arg lie Tyr Leu Glu lie lie Leu 
20 

Gly Met Lys Gin Asp Gly 
35 



Gin Ser Asp Gin lis Lys Ser Lys 
10 15 

Pro Leu Ser lie Lys Phe He. Gly 
25 30 



<210> 23 
<211> 461 
<212> DNA 

<213> Human Mammary Tumor Virus 

is 

<400> 23 

tcccttccct cgcctagrgr agatcagtca 
gqaaattata ctccccctgt caataaaggg 
gagcctacat ggttctggga aaattctcct 
20 ctagttcccc atacagaatt gtctcgctta 
aggccaggat ttcaaggaca tgacatgatt 
gccatattat taggattacc tcagctaata 
acctcctgtt cttcttgtaa attgactaat 
gcgatcatag tcaagaaggc gccatatgtg 



gatcagatta aaagcaaaaa ggatcxattt 60 
gttcatcgat ggtatgaagc aggatgggta 12 0 
aaggatccca atgatagaga ttttactgct 18 0 
gttgcagcct caagatatct tattctcaaa 24 0 
cctacatctg ocr.gtgcu.ao ttacccttat 300 
gafcatagaaa aaagaggatc tacttttcat 360 
ttgtttagat ettctgccfca cgaatatgca 42 0 
ctgctacctg t 461 



<210> 24 
<.211> 153 
<212> PRT 

<213> Human Mammary Tumor Virus 
<400> 24 

Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin He Lys Ser Lys 

1 5 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 

20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 



Arg Pro Gly ?be Gin Glu His Asp Mat He Pro Thr Ser Ala Cys Val 

35 90 95 

Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 

100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
115 120 125 



Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala He He Val 
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Lys Lys Ala Pro Tyr Val Leu Leu Pro 
145 150 



<2I0> 25 
<211> 461 
<212> DNA 
10 <213> Human Mammary Tumor Virus 

<400> 25 

tcccttccct cgcctagtgt agatcagtca gatcagatta aaagcaaaaa ggatctattt 60 

ggaaattata ctccccctgt caataaagag gttcatcgat ggtatgaagc aggatgggta 120 

is gaacctacat ggttctggga aaattctcct aaggatccca atgatagaga ttttactgct 180 

ctagttcccc atacagaacc gtctcgccta gttgcagcct caagatatct tattctcaaa 240 

aggccaggat ttcaagaaca tgacatgatt cctacacccg eccgugctac ttacccttat 300 

gccatattat taggattacc ccagccaata gatatagaaa aaagaggatc tacttttcat 360 

atttcctgtt cttcttgtag atfcgactaat tgtxraga r: ?: ct: ?;c':g::cr:a egacLarxica 420 

20 gcgatcatag tcaagaaggc gccatatgtg ctgctacctg t 461 



<213> Human Mammary Tumor Virus 
<400> 26 

Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin lie Lys Ser Lys 

30 1 5 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Gly Val His 

20 25 30 

35 Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
3 5 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 

50 " SS 60 

40 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 



Arg Pro Gly Phe Gin Glu His Asp Met lie Pro Thr Ser Ala Cys Val 
45 85 .90 95 

Thr Tyr Pro Tyr Ala lie Leu Leu Gly Leu Pro Gin Leu He Asp He 
100 105 110 

50 Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Asp Tyr Ala Ala lie He Val 
13 0 " 13 5 14 0 

55 

Lys Lys Ala Pro Tyr Val Leu Leu Pro 
145 150 
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<210> 27 
<211> 46 0 
5 <212> DblA 

<213> Human Mammary Tumor Virus 

<400> 27 

tcccttccct cgectagtgt agateagtca 

iO ggaaattata ctccccctgt caataaaggg 

gagcctacat ggttctggga aaattctcct 

ctagttcccc aiacagaatt gtttcgctta 

aggccaggat ttcaagaaca tgacatgatt 

gccatattat taggattacc tcagctaata 

15 atttcctgtt cttcttgtag attgactaat 

gcgatcatag tcaagaaggg ccatatgtgc 



gatcagatta aaagcaaaaa ggatctattt 60 
gttcatcgat gg-ar.gaagc aggatgggta 120 
aaggatccca atgatagaga ttttactgct 180 
gttgcagcct caagat at ct tattctcaaa 240 
cctacatctg cctgtgttac tt.accct.tat 300 
gatatagaaa aaagaagatc tacttttcat 360 
ttgtttagat cttccgocca cgaatatgca 420 
tgctacctgt 46 0 



<210> 28 
20 <21X> 152 
<2!2s> PRT 

<213> Human Mammary Tumor Virus 
<400> 28 

25 Ser Leu Pro Ser Pro Ser Val Asp Leu Ser Asp Gin He Lys Ser Lys 
15 10 15 

Lys Asp Leu Phe Gly Asn Tyr Thr Pro Pro Val Asn Lys Glu Val His 
20 25 30 

Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr Trp Phe Trp Glu Asn 
35 40 45 

Ser Pro Lys Asp Pro Asn Asp Arg Asp Phe Thr Ala Leu Val Pro His 
35 50 55 60 

Thr Glu Leu Phe Arg Leu Val Ala Ala Ser Arg Tyr Leu He Leu Lys 
65 70 75 80 

40 Arg Pro Gly Phe Gin Glu His Asp Met He Pro Thr Ser Ala Cys Val 



Thr Tyr Pro Tyr Ala He Leu Leu Gly Leu Pro Gin Leu He Asp He 
100 105 110 

Glu Lys Arg Gly Ser Thr Phe His He Ser Cys Ser Ser Cys Arg Leu 
115 120 125 

Thr Asn Cys Leu Asp Ser Ser Ala Tyr Glu Tyr Ala Ala He He Val 



Lys Lys Gly His Met Cys Cys Leu 
145 150 
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irus 
<400> 29 

ccccctccct cgcctagtgt agatcagtca gatcagatta aaaacaaaaa ggatctactt 

ggaaattata ctccccctgt caataaagag gttcatcgat ggtatgaagc aggatgggta 

gaaectactt gattctggga aaattctcct aaagatccca atgatagaga ttttactgct 

cta.gttc.ccc atacagaatt gttccgctta gttgcagcct caagacatct tattctcaaa 

aagccaggat ttcaagaaga tgacatgatt cctacatctg cctgtgttac ttacccttat 

gccafcattat taggattacc tcagctaata gatatagaga aaagaggatc tacttc-ca- 

atttcctgtt cttcttgtag attgactaar. tigcccagatt cttctgccta cgactatgca 
gcgatcatag tcaagaaggc gccatacgtg ctgctacctg t. 



15 <210> 30 
<21i> 43 
<212> PRT 

<213> Human Mammary Tumor Virus; 

20 <400> 30 

Ser Leu Pro Ser Pro Ser Val Asp 
1 5 

Lys Asp Leu Phe Giy Asn Tyr Thr 
25 2 0 



Gin Ser Asp Gin lie Lys Asn Lys 

10 15 

Pro Pro Val Asn Lys Glu Val His 

25 30 



Arg Trp Tyr Glu Ala Gly Trp Val Glu Pro Thr 
35 40 



